AIbase
Home
AI Tools
AI Models
MCP
AI NEWS
EN
Model Selection
Tags
Speech-Text Dual Modality

# Speech-Text Dual Modality

Ichigo Llama3.1 S Instruct V0.3 Phase 3
Apache-2.0
One of the Ichigo-llama3s series models, focusing on improving the ability to handle ambiguous inputs and multi-turn dialogues, supporting both audio and text inputs.
Text-to-Audio English
I
Menlo
20
35
Ichigo Llama3.1 S Base V0.3
Apache-2.0
Llama3-S is a multimodal language model supporting both audio and text inputs, developed based on the Llama-3 architecture with a focus on enhancing speech understanding capabilities.
Audio-to-Text English
I
Menlo
18
4
Ichigo Llama3.1 S Base V0.3
Apache-2.0
The Llama3-S series model is a multimodal language model developed by Homebrew Research, natively supporting audio and text input comprehension, extending the speech understanding capability based on the Llama-3 architecture.
Audio-to-Text English
I
homebrewltd
33
4
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
English简体中文繁體中文にほんご
© 2025AIbase